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(54) Title: AUTO-DETECTION OF LIMITING FACTORS IN A TCP CONNECTION 
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THE SYSTEM 1 00 IS READY TO BEGIN AUTO- 
DIAGNOSIS AT THE TCP RECEIVER 110. 



A SET OF EVENT STATISTICS IS DEFINED. THE 
DEFINITION OF THE SET OF EVENT STATISTICS 
DETERMINES WHAT EVENTS WIU BE MONrTORED ^ 
AND WHAT TIME GRANUlARnY WILL BE USED TO 
MONITOR THOSE EVENTS. 



THE INCOMING PACKETS THAT ARE INSERTED 
INTO THE TCP RE-ASSEMBLY QUEUE ARE TAGGED 
WnH RESPECT TO THE TIME THE PACKET 
ARRIVES AT THE TCP RECEIVER 110 AND THE 
ORDER !N WHICH THE PACKET ARRIVED. 



THE TAGS ARE EVALUATED SO AS TO IDENTIFY 
PACKETS THAT WERE RECEIVED OUT OF ORDER. 
OUT-OF-ORDER PACKETS ARE FLAGGED. 



A PARTICULAR SET OF PACKETS ARRIVING WITHIN 
A PARTICULAR TIME FRAME IS IDENTIFIED SUCH 

THAT (1) NONE OF THE PACKETS IN THE SET 
CONTAINS CONTROL INFORMATION. META-DATA 
OR OTHER TYPES OF DATA THAT REQUIRE THE 
USE OF SMALL PACKETS AND (2) ALL OF THE 
PACKETS IN THE SET ARE PART OF THE SAME 
BURST OF DATA PACKETS. 



THE AVERAGE SIZE OF THE PACKETS INCLUDED 
IN THE SET OF PACKETS IDENTIFIED IN STEP 225 
IS CALCULATED. IF A PREDEFINED NUMBER OF 
PACKETS IN THE SET IS INAPPROPRIATELY SMALL. 
A FLAG IS SET. 



THE AMOUNT OF FREE SPACE INCLUDED IN A 
BUFFER COUPLED TO THE TCP RECEIVER 110 IS 
MONFTDRED OVER A PRE-OEFINED PERIOD OF TIME 
AND COMPARED TO THE RATE AT WHICH PACKETS 
ARE RECEIVED DURING THAT SAME TIME PERIOD. 

A FLAG IS SET IF THE AMOUNT OF FREE SPACE 
FAUS BELOW A PARTICULAR PRESET THRESHOLD 
DURING TIMES AT WHICH PACKETS ARRIVE ATA 
RELATIVELY CONSTAJ^ RATE. 



TO FIG. 2B 
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(57) Abstract: This invention comprises auto-diagnosis logic that can 
be implemented in operating systems in an appliance -like auto-diagnosis 
module coupled to the TCP receiver, the TCP sender or both. TCP events 
are sampled and a set of statistics on these events is maintained. Receiver 
side TCP diagnostic techniques include detecting sender's re-transmis- 
sion timeouts, evaluating the average size of packets being received, de- 
termining if a receiver is a bottleneck,and performing other evaluations of 
an incoming data stream. Sender side diagnostic techniques include flag- 
■ ging transmission timeouts, monitoring the average size of a transmitted 
packet, evaluating if the advertised window accounts for the delay-band- 
width products of the network connecting the receiver and the sender sys- 
tems, performing bottleneck checks, and other evaluations of an outgoing 
data stream. The results are aggregated using system attributes. Systems 
with conmion problem areas and attributes. Systems with common prob- 
lem areas and attributes are grouped together. ThePCT auto-diagnosis 
logic can be performed on-line ir iff-line. 
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For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
f^ing of each regular issue of the PCT Gazette. 
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AUTO-DETECTION OF LIMITING FACTORS IN A TCP CONNECTION 



Background of the Invention 

5 L Field of the Invention 

The invention relates to diagnosing Transmission Control Protocol 
(TCP) performance problem situations. 

10 2. Related Art 

TCP is one of the most widely used transport protocols. Used both in 
the Intemet and in maay intranets, TCP is used for HTTP (Hypertext Transfer 
Protocol) and CIFTS (Common Intemet File System) traffic, as well as NFS 

15 (Network File System) data. Although TCP is a robust protocol that provides reliable 
connection-oriented commimication over a wide variety of networks and at a variety 
of speeds, the observed rate of data transfer may be less than anticipated because (1) 
either data packet receivers or data packet senders or both may be poorly configured 
or overloaded, (2) a network or a portion thereof may lack sufficient bandwidth (for 

20 example, a network that runs at gigabit rates on the fringes may include a megabit 
link somewhere in the path between the data sender and the data receiver), (3) 
multiple data packet losses may occur (due to congestion or other reasons) and 
require course grained re-transmission timeouts or (4) other related causes. 

25 Since most TCP implementations are not designed for easy debugging 

of problems, various techniques have been designed and implemented to diagnose 
TCP-related problems. A first technique involves using some type of packet capture 
mechanism, such as the Berkeley Packet Filters and manual expert analysis of 
captured low level packet traces so as to isolate abnormal protocol behavior and trace 

30 it to misconfigured or overloaded elements in a network path. Although this 
technique permits the analysis of specific transmissions, it is relatively inconvenient, 

1 
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V 

costly and error prone. A variant of this technique is embodied in a tool developed 
by LBL researchers called tcpanaly. Tcpanaly automatically analyzes a TCP 
implementation's behavior by inspecting packet traces of TCP activity using packet 
filter traces. If a trace is found inconsistent with the TCP specification, tcpanaly may 
5 provide a diagnosis (if possible) or an indication of what specific activity is aberrant. 
Similar to other packet driven systems, Tcpanaly does not focus on the general 
dynamic behavior of a network, but rather on detecting packet filter measurement 
errors, and other low-level details of TCP algorithms to handle comer conditions 
while performing congestion control and dealing with various forms of packet loss. 

10 

Other techniques, such as commercial packet sniffer systems include 
logic that analyzes aggregate TCP statistics of the kind reported by the UNIX netstat 
command; unfortunately, these other similar techniques are generally limited to a 
broad analysis of the total connections that the system has ever seen. As such, they 
15 are not usefiil means to detect or diagnose a particular defect in a specific connection 
between two systems. 

Accordingly, it would be desirable to provide a method and system for 
detecting and diagnosing TCP-related problems. This method is achieved in an 
20 embodiment of the invention in which an appliance system including auto-diagnosis 
logic can be coupled to a network and implement auto-diagnostic techniques for TCP. 

Smnmarv of the Invention 

25 The invention provides a method and system for detecting and 

analyzing performance defects in the dynamic operation of the TCP protocol. In a 
preferred embodiment, this invention comprises auto-diagnosis logic that can either 
be implemented in a variety of operating systems (such as Data ONTAP) in an 
appliance-like auto-diagnosis module that is coupled to the TCP receiver, the TCP 

30 sender or both. 



2 
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• In a first aspect of the invention, TCP events are sampled and a 

carefully maintained set of statistics on these events is maintained. The granularity of 
the sampling and the time period sampled may be adjusted so as to meet the 
requirements of a particular system. These statistics can be used in the diagnosis of 
5 defects on either the sender side or the receiver side, or both. 

Receiver side TCP diagnostic techniques include (1) detecting sender's 
re-transmission timeouts, (2) evaluating the average size of packets being received, 
(3) determining that the receiver does not act as a computational or protocol 
10 bottleneck, and (4) performing other statistical evaluations of an incoming data 
stream. 

Sender side diagnostic techniques include (1) flagging excessive 
transmission timeouts, (2) monitoring the average size of a transmitted packet, (3) 
15 evaluating if the advertised window is large enough to account for the delay- 
bandwidth product of the network connecting the receiver and the sender systems, (4) 
performing various bottleneck checks, and (5) performing other statistical evaluations 
of an outgoing data stream. 

20 In a second aspect of the invention, the results of the auto-diagnosis are 

aggregated using a database that includes known attributes of cUent systems. 
Examples of attributes include EP subnet number, OS type/version, last configuration 
change date, delay distance, route information, virtual LAN information historical 
summary of auto-diagnosis information and other attributes such as may be useftil in 

25 aggregating auto-diagnosis information. Client systems with common problem areas 
and common attributes are grouped together for presentation to a system 
administrator. 

In a preferred embodiment, the TCP auto-diagnosis logic can be 
30 performed on-line or off-line. Although this auto-diagnosis logic is relatively non- 
dismptive, for any reasonable implementation, this feature permits performance of 

3 
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system analysis at non-critical times, for example when the overall demand for 
computing resources may be relatively low. 



Brief Description of the Drawin) gs 

5 

Figure 1 shows a block diagram of a system for auto-detection of 
performance limitmg factors in a TCP connection 

Figures 2A and 2B show a process flow diagram of a method of auto- 
10 detection of performance limiting factors in a TCP receiver. 

Figures 3A and 3B show a process flow diagram of a method of auto- 
detection of performance limiting factors in a TCP sender. 

15 Detailed Description of the Preferred Embodiment 

In the following description, a preferred embodiment of the invention is 
described with regard to preferred process steps and data structures. However, those 
skilled in the art would recognize, after perusal of this application, that embodiments 
20 of the invention may be miplemented using one or more general purpose processors 
(or special purpose processors adapted to the particular process steps and data 
structures) operating under program control, and that unplementation of the preferred 
process steps and data structures described herein using such equipment would not 
require undue experimentation or further invention. 

25 

System Elements 

Figure 1 shows a block diagram of a system for auto-detection of 
limiting factors in a TCP connection. 

30 
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A system 100 includes a plurality of TCP receivers 110 and TCP 
senders 120 coupled to a communications network 130. At least one of the TCP 
senders 110 or TCP receivers 120 includes an operating system 140 and auto- 
diagnosis module 150. 

5 

The TCP receivers 110 and TCP senders 120 may comprise a general- 
purpose computer, such as a client workstation, or server or other information 
appHance falling within the generalized Turing paradigm. Although these devices are 
categorized as TCP receivers 110 or TCP senders 120, this distinction is based upon 
10 the relation between one device and other. A device that is characterized as a TCP 
receiver 110 for a particular data packet may be characterized as a TCP sender 120 
for another data packet. 

The communication network 130 is disposed for communicating data 
15 between the TCP receivers 1 10 and the TCP senders 120. In a preferred embodiment, 
the communication network 130 includes a packet switched network such as the 
Intemet, as well as (in conjunction with or instead of) an intranet, an enterprise 
network, an extranet, a virtual private network or a virtual switched network. In 
altemative embodiments, the communication network 130 may include any other set 
20 of conununication links that couple the TCP receivers 1 10 and TCP senders 120. 

The auto-diagnosis module 150 includes a set of software instructions 

155 that may be coupled to an operating system 140 (preferably Data ONTAP). In 

« 

other embodiments, the autodiagnosis module 150 may take the form of a separate 
25 appUance that can be coupled to either a TCP receiver 1 10 or a TCP sender 120, or 
both. These software instructions 155 include logical techniques to diagnose 
problematic conditions on both the TCP receiver 110 and the TCP sender 120, In 
addition to these logical techniques, the software instructions 155 also include (1) 
instructions to perform various determination procedures (described infira). (2) 
30 instructions to remedy particular problematic conditions, and (3) instmctions to 
aggregate information for presentation to a system administrator. 
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Software instructions 155 that can be implemented on the TCP receiver 
110 side include: 

• Instructions to detect and evaluate the TCP sender 120 re-transmission time-outs 
The set of instructions 155 evaluates re-transmission timeouts occmring at the 
TCP sender 120. The set of instructions 155 includes an instruction to tag each 
received packet that is inserted iuto the TCP re-assembly queue. It is possible to 
distinguish betvi^een quiet periods that occur in '^bursty" traffic and re- 
transmission timeouts because the instructions evaluate both (1) the time at which 
a data packet arrives on each TCP receiver 1 10 and (2) the sequence number in 
which Ihe data packets arrive. A number of out-of-sequence packets with a 
particular inter-packet arrival gap (typically 500nis) are indicative of a 
retransmission timeout at the TCP sender. 

• Average packet size 

The instruction set 155 includes an instruction to determine the size of incoming 
data packets and determine the arrival time of the incoming packet so as to 
determine if the TCP sender 120 is using sufficientiy large frames. The 
instruction set 155 eliminates confounding information such as control 
information, or meta-data that are necessarily communicated using small TCP 
packets by determining the size of those incoming packets that are part of a burst 
of back-to-back packets. This determination is not performed upon incoming 
packets that are not part of such a burst. In the event that the average packet size 
is inappropriately small (such as may occur if an application is poorly written or a 
network is misconfigured), the instruction set 155 may provide for remedial steps. 

• Botdeneck checks 

The mstruction set 155 includes three related subsets of instructions that 
determine whether tiie TCP receiver 110 creates a bottleneck in data transfer. A 
first inshnction subset involves monitoring the amount of fi«e space available in a 
buffer coupled to the TCP receiver 110. A second instruction subset involves 
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monitoring the amount of receiver CPU utilized for protocol processing and 
comparing the amount of CPU cycles that are consumed by TCP processing to the 
amount left over as idle time. A third instruction subset involves monitoring the 
rate at which the receiver application reads from the TCP socket receiver buffers. 
5 It is unlikely that a TCP receiver 110 is acting as a computational or protocol 
bottleneck if (1) packets arrive continually at a roughly constant rate without the 
amount of jfree space shrinking substantially, (2) the TCP receiver 110 is not 
running out of CPU cycles to perform protocol processing or (3) the rate at which 
receiver appUcation reads from the buffers does not fall below a preset threshold. 

10 

Software instructions 155 that can be implemented on the TCP sender 
120 side include: 

• Flagging excessive re-transmission timeouts 

15 The instruction set 155 includes instractions to (1) deteraiine when a coarse- 
grained re-transmission timer expires and a packet is re-transmitted, (2) determine 
if the number of re-transmission timeouts on any TCP connection exceeds a 
particular preset threshold and (3) notify a system manager in the event that this 
threshold is exceeded. 

20 

• Average packet size 

The instruction set 155 includes mstructions to monitor the average size of 
transnaitted data packets sent by a TCP sender 120. Similar to packet size 
monitoring performed on the TCP receiver 110, limiting the analysis of average 
25 packet size to packets included in a back-to-back packet train eliminates a number 
of factors that would skew the data, such as small packets that carry control 
information. 

• Receiver window monitoring 

30 The instruction set 155 on the TCP sender 120 monitors the window advertised by 
the TCP receiver 1 10, Specific instructions cause the TCP sender 120 to check if 

7 
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the advertised window is large enough to account for the delay bandwidth product 
of the path between sender and receiver. The instruction set 155 also controls 
monitoring of the variation in the TCP receiver's advertised window over time. If 
the advertised window remains relatively constant, then it is not likely that the 
TCP receiver 11 0 is acting as a bottleneck. 

• Bottleneck checks 

Similar to the TCP receiver 110, the instruction set 155 includes various 
instruction subsets to insure that the TCP sender 120 is not a computational or 
protocol bottleneck. The first instruction subset involves comparing the size of a 
buffer coupled to the TCP sender 120 with the full bandwidth of the network path 
between the TCP sender 120 and TCP receiver 110. If the buffer size is 
appropriate, then absent other indications such as described below, it is unlikely 
that the TCP sender is acting as a bottleneck. A second instruction subset 
ittvolving monitoring the unsent, queued data in the send buffer. A TCP sender 
120 may be acting as a bottieneck if there are periods of time in which the TCP 
sender 120 is unable to send anything. Lastly, a third instruction subset monitors 
whether the TCP sender 120 can perform protocol processing sufficiently fast so 
as to maintain pace with the data consumption of the TCP receiver 110. If the 
protocol processing is performed at a slow rate with respect to the TCP receiver 
1 10, the TCP sender may be acting as a botdeneck. 

The instructions to perform various determination procedures include 
measurement of offline delay calculation by using a periodic ICMP (Internet Control 
Message protocol) ping and measuring the roundrip time, TCP fingerprinting and 
making various assumptions regarding bandwidth, MTU size and RTO estunates. 
These various determination procedures may be used to perform other types of 
instruction described supra. 

In addition to the software instructions 155, the auto-diagnosis module 
also includes a database 157. The database 157 includes a set of fields 158 describing 

8 
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' various attributes of known client systems such as IP subnet number, OS 
type/version, last configuration change date, delay distance, route information virtual 
LAN information, historical simmniary auto-diagnosis information and other such 
attributes as may be useful when aggregating diagnosis information and presenting it 
5 to a system administration. Each field included in the set of fields 158 is assigned an 
appropriate weight. This weighted value is used when auto-diagnosis results are 
agg^^eigated. 

Figures 2A and 2B show a process flow diagram of a method of auto- 
10 detection of limiting factors in a TCP receiver. 

A method 200 auto-diagnosis is performed by an embodiment of the 
system 100 in which the auto-diagnosis module 150 is coupled to the TCP receiver 
110 side. Although the auto-diagnosis method 200 is described serially, the steps of 
15 the method 200 can be performed by separate elements in conjunction or in parallel, 
whether asynchronously, in a pipelined manner, or otherwise. There is no particular 
requirement that the method 200 be performed in the same order in which this 
description lists the steps, except where so indicated. 

20 At a flow point 205, the system 100 is ready to begin auto-diagnosis at 

the TCP receiver 110. 



In a step 210, a set of event statistics is defined. The definition of the 
set of event statistics determines what events will be monitored and what time 
25 granularity will be used to monitor those events. In a preferred embodiment, it is 
possible to identify particular events and particular granularity so as to tailor the auto- 
diagnosis to a particular system. 



In a step 215, incoming packets that are inserted into the TCP re- 
30 assembly queue are tagged with respect to the time the packet arrives at the TCP 
receiver 110 and the order in which the packet arrives. 



9 
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In a step 220, the tags are evaluated so as to identify packets that are 
received out-of-order. If a packet is identified as arriving out-of-order, and the inter- 
packet arrival time is above a threshold value, the event is flagged as indicative of a 
TCP sender 120 re-transmission timeout. An excessive number of such re- 
transmission timeouts will be deemed as indicative of a performance problem due to 
some problem in the network path between the sender and the receiver and may be 
reported to the system administrator. Remedial procedures may be suggested to the 
administrator or implemented, if possible. 

In a step 225, a particular set of packets arriving within a particular time 
frame is identified such that (1) none of the packets in the set contains control 
information, meta-data or other types of data that require the use of small packets and 
(2) all of the packets in the set are part of the same burst of data packets. A "burst" is 
defined as a set of packets that are transmitted back-to-back by the TCP sender. 

In a step 230, the average size of the packets included in die set of 
packets identified in step 225 is calculated. If a predefined number of packets in the 
set is inappropriately small, a flag is set indicating that the TCP appUcation may be 
poorly written or the network may be misconfigured. Remedial procedures may be 
suggested to the system administrator or implemented. 

In a step 235, the amount of fi-ee space included in a buffer coupled to 
the TCP receiver 1 10 is monitored over a pre-defined period of time and compared to 
the rate at which packets are received during that same time period. A flag is set if 
the amount of free space falls below a particular preset threshold during times at 
which packets arrive at a relatively constant rate. This will be deemed indicative of a 
situation where the receiver is the botfleneck in data transfer. Remedial procedures 
may be suggested to the system's administrator or implemented, if possible. 



10 
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In a step 240, the number of CPU cycles utilized for receiver TCP 110 
processing is compared to the amount left over as idle time. If the mmiber of CPU 
cycles exceeds a pre-defined threshold, a flag is set. This situation is deemed 
indicative of the receiver system being the bottleneck in data transfer. Remedial 
5 procedures may be automatically implemented, if possible. 

In a step 245, the rate at which the receiver application reads from the 
TCP socket receiver buffers is monitored. If the rate falls below a pre-defined 
threshold, a flag is set. If a flag is set, then remedial procedures may be automatically 
10 implemented, if possible. 

In a step 250, post-diagnostic aggregation is performed upon the events 
that have been monitored so that appropriate types and appropriate amounts of data 
may be presented to a system administrator. This post-diagnostic aggregation 

15 includes identifying some or all of the following: (1) groups of client systems that are 
appropriate for aggregation, (2) a set of one or more attribute types (such as IP subnet 
number, OS type/version, last configuration change and other factors) that may be 
particularly relevant, and (3) sets of problems that occurred within these client 
systems. Aiter determining which client systems are appropriate for aggregation, 

20 systems that share a common problem are identified. This is done by looking to the 
natm-e and weight given to the various attributes as stored in database 157. 

In a step 255, a final report is generated based upon (1) the post- 
diagnostic aggregation and (2) a description of individual problems that were not 
25 aggregated. 

In a step 260, the final report is transmitted to a system administrator. 

In a step 265, a record of diagnostic conclusions, remedial procedures, 
30 messages to system administrators and other information relating to the performance 
of a method 200 is stored in the database 157. 



11 
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Figures 3A and 3B show a process flow diagram of a method of auto- 
detection of limiting factors in a TCP sender. 

A method 300 for auto-diagnosis is performed by an embodiment of the 
system 100 in which the auto-diagnosis module 150 is coupled to the TCP sender 120 
side. Although the auto-diagnosis method 300 is described serially, the steps of the 
method 300 can be performed by separate elements ia conjunction or in parallel, 
whether asynchronously, in a pipelined manner, or otherwise. There is no particular 
requirement that the method 300 be performed m the same order in which this 
description lists the steps, except where so indicated. 

At a flow point 305, the system 100 is ready to begin auto-diagnosis at 
the TCP sender 120. 

In a step 310, a course grained re-transmission timer is monitored to 
determine when it expires and a packet is re-transmitted. If the number of re- 
transmission timeouts exceeds a particular threshold, a flag is set. This situation is 
deemed indicative of some problem in the network path between the sender and the 
receiver. Remedial action may be implemented, if possible. 

At a step 315, a particular set of packets sent within a particular time 
frame is identified such that (1) none of the packets m the set contains control 
information, meta-data or other types of data that require the use of small packets and 
(2) all of the packets in the set are part of the same burst of data packets. 

In a step 320, the average size of the packets included in the set of 
packets identified in step 3 15 is calculated. If a predefined number of packets in the 
set is inappropriately small, a flag is set indicating that the TCP application may be 
poorly written or the network may be misconfigured. Remedial procedures may be 
implemented, if possible. 
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In a step 325, the size of the window advertised by the TCP receiver 
110 is monitored. 

In a step 330, a delay bandwidth product is calculated. 

In a step 335, the size of the advertised window and the delay 
bandwidth product are compared. If the size of the window is not large enough to 
account for the delay-bandwidth product, this may indicate that the TCP receiver 110 
is acting as bottleneck. Under these conditions, a flag is set, and remedial steps may 
be taken if possible. 

In a step 340, variations in the size of the advertised window on the 
TCP receiver 1 10 are compared to the performance of the TCP sender 120. If the 
15 size variations fit a predefined pattem, the TCP sender 120 may be acting as a 
bottleneck. Under these conditions, a flag is set and remedial steps may be taken, if 
possible. 

In a step 345, the size of the TCP sender 120 buffer is compared to the 
20 bandwidth of the network path between the TCP sender 120 and TCP receiver 110. If 
the send buffer size is inadequate in relation to the bandwidth of the network path, 
then the TCP sender 120 may be a bottleneck. Under these conditions, a flag is set 
and remedial steps may be taken if possible. 

25 In a step 350, the amount of unsent, queued data in the send buffer on 

each TCP sender 120 is monitored. If the number of times that the TCP sender 120 is 
unable to send anything on a connection exceeds a particular threshold, then the TCP 
sender 120 may be a bottieneck. Under these conditions a flag is set and remedial 
steps may be taken if possible. 

30 
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In a step 355, post-diagnostic aggregation is performed upon the events 
that have been monitored. This step is similar to step 250 in the Method 200. This 
post-diagnostic aggregation includes identifying (1) groups of client systems that are 
appropriate for aggregation. (2) a set of one or more attribute types (such as IP subnet 
number, OS type/version, last conjBguration change and other factore) that may be 
particularly relevant or (2) sets of problems that occurred within these client systems. 
After determining which client systems are appropriate for aggregation, systems that 
share a common problem are identified by looking to the weight of the various 
attributes stored in database 1 57. 

In a step 360, a fmal report is generated based upon (1) the post- 
diagnostic aggregation and (2) a description of individual problems that were not 
aggregated. 

In a step 365, the final report is transmitted to a system administrator. 

In a step 370, a record of diagnostic conclusions, remedial procedures, 
messages to system administrators and other information relating to the performance 
of a method 300 is stored in the database 157. 

Alternative Embodiments 

Although preferred embodiments are disclosed herein, many variations 
are possible which remain within the concept, scope, and spirit of the invention, and 
these variations would become clear to those skilled in the art after perusal of this 
application. 
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Claims 

1 . An apparatus, including 

at least one first computing device, including a processor and a 
5 memory, wherein said first computing device is a TCP receiver; 

at least one second computing device, including a processor and a 
memory; wherein said second computing device is a TCP sender, 

a communications network coupled to said first computing device and 
said second computing device; 
10 a set of instructions coupled to said memory associated with said first 

computing device, wherein said set of instructions includes instructions for (1) 
defining a set of data concerning TCP operations, (2) evaluating said set of data 
concerning TCP operation; and (3) generating a result of said evaluating; and 

a database coupled to said memory associated with said first computing 

15 device. 



20 



2. An apparatus as in claim 1, wherein said instructions for 
defining a set of data concerning TCP operations includes defining a set of events to 
monitor and a time granularity in which said events will be monitored. 

3. An apparatus as in claim 1, wherein said set of instructions for 
evaluating said set of data concerning TCP operation includes a subset of instructions 
for detecting retransmission timeouts at a TCP receiver. 



25 4. An apparatus as in claim 3, wherein said subset of instructions 

for detecting retransmission timeouts includes 

an instruction to tag each packet that is inserted into a TCP reassembly 
queue with a timestamp and a sequence niunber; 

an instmction to read each tag and sort so as to identify said tags that 
30 are not in sequential order; 
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an instruction to generate a flag for every said tag that is not in 
sequential order; and 

store said flags in said memory. 

5 . An apparatus as in claim 1 , wherein said set of instructions for 
evaluating said set of data concerning TCP operation includes a subset of instructions 
for evaluating the average packet size of incoming packets. 

6. An apparatus as in claim 5, wherein said subset of instructions 
for evaluating the average packet size includes 

an instruction to identify packets that are included in burst of back-to- 
back packets; 

an instruction to evaluate the size of said identified packets; 
an instruction to generate a flag if any packet included in said burst of 
back-to-back packets falls below a pre-set threshold; and 

an instruction to store said flags in said memory. 

7. An apparatus as in claim 1, wherein said set of instructions for 
evaluating said set of data concerning TCP operation includes at least three subsets of 
instructions for performing a bottleneck check at a receiver TCP. 

8. An apparatus as in claim 7, wherein one of said at least three 
subsets of instructions includes 

an instruction to monitor the advertised window size in said Gist 
computing device; 

an instruction to monitor the rate at which packets arrive at said first 
computing device; 

an instruction to generate a flag is the size of said advertised window 
falls below a certain threshold while said rate at which packets arrive remains within 
a predefined range; and 

an instruction to store said flags in said memory. 
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4 

9. An apparatus as in claim 7, wherein one of said at least three 
subsets of instructions includes 

an instruction to monitor the number of cycles associated with said 
5 processor included in said first computing device, that are utiUzed for processing of 
TCP protocol; 

an instruction to monitor the amount of said processor associated with 
said first computing device that is characterized as idle time; 

an instruction to compare the result of monitoring the amoimt of said 
10 processor that is utilized for TCP processing with said result of monitoring the 
amoimt of said processor that is utilized for processing of TCP protocol; 

an instruction to generate a flag if the ratio between the amount of said 
processor that is utilized for TCP processing and the amount of said processor that is 
utilized for processing of TCP protocol falls below a pre-set threshold;and 
15 an instruction to store said flag in said memory. 

10. An apparatus as in claim 7, wherein one of said at least three 
subsets of instructions includes 

an instruction to monitor the rate at which said first computing device 
20 reads firom the TCP buffers; 

an instruction to compare said rate at which said first computing 
devices reads to a standard rate; 

an instruction to generate a flag if said rate at which first computing 
device reads exceeds said standard rate by a predetermined value; and 
25 an instruction to store said flag in said memory. 

11. An apparatus as m claim 1 , wherein said set of instructions also 
includes a subset of instruction to aggregate said result. 
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12. An apparatus as in claim 1 1, wherein said subset of instructions ' 
to aggregate includes an instruction to aggregate based upon information stored in 
said database. 

13. An instruction as in claim 1 1 , wherein said subset of instructions 
to aggregate include an instruction to aggregate based upon said result. 

14. An apparatus, including 

at least one first computing device, including a processor and a 
memory, wherein said first computing device is a TCP sender; 

at least one second computing device, including a processor and a 
memory; wherein said second computuig device is a TCP receiver; 

a communications network coupled to said first computing device and 
said second computing device; 

a set of instructions coupled to said memory associated with said 
second computing device, wherein said set of instructions includes instructions for (1) 
defining a set of data concerning TCP operations, (2) evaluating said set of data 
concerning TCP operation; and (3) generating a result of said evaluating; and 

a database coupled to said memory associated with said first computing 

device. 



15. An apparatus as in claim 14, wherem said instructions for 
defining a set of data concerning TCP operations includes defining a set of events to 
monitor and a time granularity in which said events will be monitored. 

16. An apparatus as in claim 1 4, wherem said set of instructions for 
evaluating said set of data concerning TCP operation includes a subset of instructions 
for detecting retransmission timeouts at said first computing device. 

17. An apparatus as in claim 1 6, wherein said subset of instmctions 
for detecting retransmission timeouts includes 
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an instruction to determine when a re-transmission time expires and 
packets are retransmitted; 

an instruction to measure the nmnber of re-transmissions occurred 
within a predetermined time period; 

an iostruction to generate a flag if said nimiber of re-transmissions 
exceeds a predetermined threshold; and 

an instruction to store said flags in said memory. 

18. An apparatus as in claim 14, wherein said set of instructions for 
evaluatiug said set of data concerning TCP operation includes a subset of instructions 
for evaluating the average packet size of outgoing packets. 

19. An apparatus as in claim 1 8, wherein said subset of instructions 
for evaluating the average packet size includes 

an instruction to identify packets that are included in burst of back-to- 
back packets; 

an iostruction to evaluate the size of said identified packets; 
an instruction to generate a flag if any packet included in said burst of 
back-to-back packets falls below a pre-set threshold; and 

an instruction to store said flags in said memory. 

20. An apparatus as in claim 14, wherein said set of instructions 
includes a subset of iostructions for monitoring the receiver window associated with 
said second computing device. 

21. An apparatus as in claim 20, wherein said subset of 
instructions for monitoring said receiver window includes 

an instruction to monitor the variation of space available in said 
receiver window over time; 

an instruction to generate a flag if said variation in space available in 
said window fits a particular predetermined pattem; and 
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an instruction to store said flags in said memory. 



22. An apparatus as in claim 14, wherein said set of instructions 
for evaluating said set of data concerning TCP operation includes at least three 
subsets of instructions for performing a bottleneck check at a sender TCP. 

23 . An apparatus as in claim 22, wherein one of said at least three 
subsets of instructions includes 

an instruction to monitor the size of a buffer associated with said first 
computing device; 

. an instruction to monitor the rate at which packets are transmitted; 

an instruction to generate a flag if the space available in said buffer falls 
below a certain threshold while said rate at which packets are transmitted remains 
within a predefined range; 

an instruction to store said flag in said memory. 

24. An apparatus as in claim 22, wherein one of said at least three 
subsets of instructions includes 

an instruction to monitor the amount of unsent queued data in a buffer 
associated Avith said first computing device; 

an instruction to count the number of times that the first computing 
device is unable to transmit said imsent queued data during a pre-set interval of time; 

an instruction to generate a flag if said number of times said first 
computing device is unable to transmit exceeds a pre-set threshold; and 

an instruction to store said flag in said memory. 

25. An apparatus as in claim 21 , wherein one of at least three subsets 
of instructions includes 

an instruction to monitor the rate that said first computing device 
performs protocol processing; 
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an instruction to monitor said the data consumption bandwidth of said 
second computing device; 

an instraction to compare the said rate of protocol processing with said 
data consumption bandwidth; 

an instraction to generate a flag is the ratio between said rate and said 
bandwidth exceeds a pre-set threshold; and 

an instraction to store said flag in said memory. 

26. An apparatus as in claim 14, wherein said set of instructions also 
includes instractions to aggregate said result. 

21. An apparatus as in claim 26, wherein said instractions to 
aggregate includes an instraction to aggregate based upon information stored in said 
database. 



28. An apparatus as in claim 26, wherein said instractions to 
aggregate include an instraction to aggregate based upon said result. 



29. A method, including steps of 

defining a set of data concerning TCP operations; 
evaluating the performance of a TCP receiver or a TCP sender; 
generating a set of data regarding said performance; and 
aggregating said set of data. 

30. A method as in claim 29, wherein said step of defining a set of 
data includes steps of 

defining a set of events to monitor; and 

defining a time granularity in which said set of events will be 

monitored. 
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31. A method as in claim 29, wherein said step of evaluating said ' 
perfonnance includes steps of 

detecting retransmission timeouts at a TCP receiver by looking to the 
time and order in which data packets are received 

evaluating the average size of incoming packets included in a burst of 
back-to-back packets; 

monitoring the advertised window size of said TCP receiver and 
comparing to the rate at which said data packets are received; 

monitoring the number of processor cycles used to perform TCP 
protocols and comparing them to the number of cycles characterized as idle time; and 

monitoring the rate at which said TCP receiver reads &om a buffer and 
comparing it to a standard rate. 

32. A method as in claim 29, wherein said step of evaluating said 
performance includes steps of 

detecting retransmission timeouts by monitoring when a re-transmission 
time expires and packets are re-transmitted by a TCP sender; 

evaluating the average size of outgoing packets by monitoring the size 
of said outgoing packets included in a burst of back-to-back out going packets; 

monitoring the size of a receiver window associated with a TCP 

receiver; 

monitoring patterns in the size of said receiver window; 

monitoring the size of a buffer associated with said TCP sender and 
comparing to the rate at which said packets are transmitted; 

monitoring liie amount of unsent queued data in said buffer associated 
with a TCP sender; and 

monitoring the rate at which protocol processing is performed and 
comparing it to data consumption bandwidth. 
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33. A method as in claim 29, wherein said step of aggregating 
includes aggregating based upon (1) attributes of a computing device and (2) results 
associated with said set of data. 
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THE SYSTEM 100 IS READY TO BEGIN AUTO- 
DIAGNOSIS AT THE TCP RECEIVER 110. 



A SET OF EVENT STATISTICS IS DEFINED. THE 
DEFINITION OF THE SET OF EVENT STATISTICS 
DETERMINES WHAT EVENTS WILL BE MONITORED 
AND WHAT TIME GRANULARITY WILL BE USED TO 
MONITOR THOSE EVENTS. 



THE INCOMING PACKETS THAT ARE INSERTED 
INTO THE TCP RE-ASSEMBLY QUEUE ARE TAGGED 
WITH RESPECT TO THE TIME THE PACKET 
ARRIVES AT THE TCP RECEIVER 110 AND THE 
ORDER IN WHICH THE PACKET ARRIVED. 







THE TAGS ARE EVALUATED SO AS TO IDENTIFY 
PACKETS THAT WERE RECEIVED OUT OF ORDER. 
OUT-OF-ORDER PACKETS ARE FLAGGED. 







A PARTICULAR SET OF PACKETS ARRIVING WITHIN 
A PARTICULAR TIME FRAME IS IDENTIFIED SUCH 

THAT (1) NONE OF THE PACKETS IN THE SET 
CONTAINS CONTROL INFORMATION, META-DATA 
OR OTHER TYPES OF DATA THAT REQUIRE THE 
USE OF SMALL PACKETS AND (2) ALL OF THE 
PACKETS IN THE SET ARE PART OF THE SAME 
BURST OF DATA PACKETS. 



THE AVERAGE SIZE OF THE PACKETS INCLUDED 
IN THE SET OF PACKETS IDENTIFIED IN STEP 225 
IS CALCULATED. IF A PREDEFINED NUMBER OF 
PACKETS IN THE SET IS INAPPROPRIATELY SMALL. 
A FLAG IS SET. 



THE AMOUNT OF FREE SPACE INCLUDED IN A 
BUFFER COUPLED TO THE TCP RECEIVER 110 IS 
MONITORED OVER A PRE-DEFINED PERIOD OF TIME 
AND COMPARED TO THE RATE AT WHICH PACKETS 
ARE RECEIVED DURING THAT SAME TIME PERIOD. 

A FLAG IS SET IF THE AMOUNT OF FREE SPACE 
FALLS BELOW A PARTICULAR PRESET THRESHOLD 
DURING TIMES AT WHICH PACKETS ARRIVE ATA 
RELATIVELY CONSTANT RATE. 



TO FIG. 2B 

FIG.2A 
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FROM FIG. 2A 



THE NUMBER OF CPU CYCLES UTILIZED FOR TCP 
PROCESSING IS COMPARED TO THE AMOUNT LEFT 

OVER AS IDLE TIME. IF THE NUMBER OF CPU 
CYCLES EXCEEDS A PRE-DEFINED THRESHOLD. 
A FLAG IS SET. 



THE RATE AT WHICH THE RECEIVER APPLICATION 
READS FROM THE TCP SOCKET RECEIVER BUFFERS 
IS MONITORED. IF THE RATE FALLS BELOW A PRE- 
DEFINED THRESHOLD, A FLAG IS SET. 



POST-DIAGNOSTIC AGGREGATION IS PERFORMED 
UPON THE EVENTS THAT HAVE BEEN MONITORED. 



A FINAL REPORT IS GENERATED BASED UPON (1) 
THE POST-DIAGNOSTIC AGGREGATION AND (2) A 
DESCRIPTION OF INDIVIDUAL PROBLEMS THAT 
WERE NOT AGGREGATED. 



THE FINAL REPORT IS TRANSMITTED 
TO A SYSTEM ADMINISTRATOR 



A RECORD OF DIAGNOSTIC CONCLUSIONS, 
REMEDIAL PROCEDURES, MESSAGES TO SYSTEM 

ADMINISTRATORS AND OTHER INFORMATION 
RELATING TO THE PERFORMANCE OF A METHOD 
200 IS STORED IN THE DATABASE 157. 



-240 



-245 



-250 



■255 



-260 



-265 



FIG. 2B 
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THE SYSTEM 100 IS READY TO BEGIN AUTO- 
DIAGNOSIS AT THE TCP SENDER 120. 



A COARSE-GRAINED RE-TRANSMISSION TIME IS 
MONITORED TO DETERMINE WHEN IT EXPIRES AND 
A PACKET IS RE-TRANSMITTED. IF THE NUMBER 
OF RE-TRANSMISSION TIMEOUTS EXCEEDS A 
PARTICULAR THRESHOLD. A FLAG IS SET. 



A PARTICULAR SET OF PACKETS SENT WITHIN A 
PARTICULAR TIME FRAME IS IDENTIFIED SUCH THAT 
(1) NONE OF THE PACKETS IN THE SET CONTAINS 

CONTROL INFORMATION, META-DATA OR OTHER 
TYPES OF DATA THAT REQUIRE THE USE OF SMALL 
PACKETS AND (2) ALL OF THE PACKETS IN THE SET 
ARE PART OF THE SAME BURST OF DATA PACKETS. 



■305 



310 



■315 



THE AVERAGE SIZE OF THE PACKETS INCLUDED 
IN THE SET OF PACKETS IDENTIFIED IN STEP 315 
IS CALCULATED. IF A PREDEFINED NUMBER OF 
PACKETS IN THE SET IS INAPPROPRIATELY SMALL, 

A FLAG IS SET INDICATING THAT THE TCP 
APPLICATION MAY BE POORLY WRITTEN OR THE 
NETWORK MAY BE MISCONFIGURED. 







THE SIZE OF THE WINDOW ADVERTISED BY THE 
TCP RECEIVER 110 IS MONITORED. 






A DELAY BANDWIDTH PRODUCT IS CALCULATED. 







THE SIZE OF THE ADVERTISED WINDOW AND THE 
DELAY BACKWIDTH PRODUCT ARE COMPARED. IF 
THE SIZE OF THE WINDOW IS NOT LARGE ENOUGH 
TO ACCOUNT FOR THE DELAY-BANDWIDTH 
PRODUCT, THIS MAY INDICATE THAT THE TCP 
RECEIVER 110 IS ACTING AS BOTTLENECK. UNDER 
THESE CONDITIONS, A FLAG IS SET. 



-320 



■325 



-330 



■335 



VARIATIONS IN THE SIZE OF THE ADVERTISED 
WINDOW ON THE TCP RECEIVER 110 ARE COMPARED 
TO THE PERFORMANCE OF THE TCP SENDER 120. 



-340 



TO FIG. 3B 



FIG. 3A 
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FROM FIG. 3A 



THE SIZE OF THE TCP SENDER 120 BUFFER IS 
COMPARED TO THE BANDWIDTH OF THE NETWORK 
PATH BETWEEN THE TCP SENDER 120 AND TCP 
RECEIVER 110. IF THE SEND BUFFER SIZE IS 
INADEQUATE IN RELATION TO THE BANDWIDTH OF 
THE NETWORK PATH. A FLAG IS SET. 



THE AMOUNT OF UNSENT, QUEUED DATA IN THE 

SEND BUFFER ON EACH TCP SENDER 120 IS 
MONITORED. IF THE NUMBER OF TIMES THAT THE 
TCP SENDER 120 IS UNABLE TO SEND ANYTHING 
ON A CONNECTION EXCEEDS A PARTICULAR 
THRESHOLD, THEN A FLAG IS SET. 



POST-DIAGNOSTIC AGGREGATION IS PERFORMED 
UPON THE EVENTS THAT HAVE BEEN MONITORED. 



■345 



350 
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A FINAL REPORT IS GENERATED BASED UPON (1) 
THE POST-DIAGNOSTIC AGGREGATION AND (2) A 
DESCRIPTION OF INDIVIDUAL PROBLEMS THAT 
WERE NOT AGGREGATED. 



THE FINAL REPORT IS TRANSMITTED 
TO A SYSTEM ADMINISTRATOR. 



A RECORD OF DIAGNOSTIC CONCLUSIONS, 
REMEDIAL PROCEDURES, MESSAGES TO SYSTEM 

ADMINISTRATORS AND OTHER INFORMATION 
RELATING TO THE PERFORMANCE OF A METHOD 
300 IS STORED IN THE DATABASE 157. 
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FIG. 3B 
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